Receptor tyrosine kinase with a discoidin-type binding domain

ABSTRACT

A breast carcinoma tyrosine phosphoprotein, DDR (Discoidin Domain Receptor), that defines a novel class of receptor tyrosine kinases is presented. The DDR cDNA predicts a polypeptide C-terminal tyrosine kinase domain and an N-terminal domain similar to the  Dictyostelium discoideum  lectin discoidin I. These domains are connected by an extraordinary hydrophilic proline/glycine-rich domain, which is interrupted by a predicted transmembrane sequence. This extended proline/glycine-rich region suggests an unusual geometry of interaction with ligand or substrates. Discoidin I-type domains may interact with specific cell surface molecules.

[0001] This invention was made with Government support under Grant No. DK37661, awarded by the National Institute of Health. The Government has certain rights in this invention.

BACKGROUND OF THE INVENTION

[0002] 1. Field of the Invention

[0003] The present invention relates to receptor tyrosine kinases with discoidin-type or like binding domains and nucleic acids encoding such receptor tyrosine kinases. It is noted that the “type” or “like” term that is associated with the names discoidin, discoidin I, and equivalent molecules in this disclosure implies that structural and functional similarities exist between the subject molecule's binding domain and the binding domain found in discoidin, discoidin I, and equivalent molecules.

[0004] 2. Description of the Background Art

[0005] Certain extracellular molecules influence cellular growth, differentiation or development via activation of receptor tyrosine kinases (Bishop, J. M. (1991) Cell 64, 235-248 and Hunter, T. (1991) Meth. Enz. 200, 3-37, which are incorporated herein by reference). Receptor tyrosine kinases are composed of an extracellular ligand binding domain and a cytoplasmic catalytic domain that allow for specific decoding of extracellular signals and initiation of intracellular biochemical effects. A large number of cDNAs for putative receptor tyrosine kinases have been cloned and sequenced (Hanks, S. K. & Quinn, A. M. (1991) Meth. Enz. 200, 38-62, which is incorporated herein by reference). With rare exception (Shier, P. & Watt, V. M. (1989) J. Biol. Chem. 264, 14605-14608, which is incorporated herein by reference), sequence similarities in extracellular domains have not suggested the type of ligand that might activate a given receptor. Nevertheless, the diversity of receptor tyrosine kinase extracellular domains implies that there are a large number of as yet unidentified extracellular molecules that can regulate receptor tyrosine kinase activity and, by extension, growth, differentiation or development. Identification of these ligands is crucial for a comprehensive understanding of these fundamental biological processes. It is clear that researchers are only beginning to understand the complexity of the extracellular signals that can activate receptor tyrosine kinases. Also, little is known about the fundamental process of information exchange across the membrane to initiate the intracellular cascade of biochemical effectors. Disclosed here is the cDNA cloning and characterization of a novel breast carcinoma cell protein with a primary structure which suggests that this protein is a receptor tyrosine kinase with an unusual mechanism of transmembrane signaling. An extracellular domain of this protein is similar to the lectin-like or type protein or specifically the carbohydrate-binding protein discoidin I.

SUMMARY OF THE INVENTION

[0006] An object of the present invention is to present a polypeptide having both a first domain with carbohydrate or discoidin I-type binding activity and a second domain with tyrosine kinase activity.

[0007] Another object of the present invention is to disclose a nucleic acid sequence that encodes for a polypeptide having a first domain with carbohydrate or discoidin I-type binding activity and a second domain with tyrosine kinase activity.

[0008] A further object of the present invention is to make known a marker that may be utilized to predict and diagnose tumors of the breast and lung.

[0009] Still another object of the present invention is to describe a nucleic acid sequence that encodes a polypeptide having both discoidin I-type ligand binding characteristics and tyrosine kinase activity that can be utilized in immunological and inhibitory regulation of associated tumors.

[0010] Yet a further object of the present invention is to disclose a polypeptide having both discoidin I-type ligand binding characteristics and tyrosine kinase activity that can be utilized in immunological and inhibitory regulation of associated tumors.

[0011] Disclosed is a novel breast carcinoma tyrosine phosphoprotein, DDR (Discoidin Domain Receptor), that defines a novel class of receptor tyrosine kinases. The DDR cDNA predicts a C-terminal tyrosine kinase domain and an N-terminal domain similar to the Dictyostelium discoideum lectin named discoidin or more specifically discoidin I. These domains are connected by an extraordinary hydrophilic proline/glycine-rich domain, which is interrupted by a predicted transmembrane sequence. This extended proline/glycine-rich region suggests an unusual geometry of interaction with ligand or substrates. Discoidin I-type domains are also found in other proteins, including coagulation factors V and VIII. Discoidin I-type domains may interact with specific cell surface molecules. SEQ ID NO: 1 designates the nucleotide sequence of the present invention, SEQ ID NO: 2 designates the polypepteide or protein sequence of the present invention, and SEQ ID NO: 3 designates the probe utilized in the subject invention.

[0012] Other objects, advantages, and novel features of the present invention will become apparent from the detailed description that follows, when considered in conjunction with the associated drawings.

DESCRIPTION OF THE FIGURES

[0013]FIGS. 1a-h (also, see SEQ ID NOS: 1 and 2 show the nucleotide and deduced amino acid sequence of the human DDR cDNA. Amino acids are numbered for the precursor protein. The single underlined amino acid sequence near the N-terminus contains the discoidin I-type domain and the single underlined amino acid sequence near the C-terminus contains the tyrosine kinase domain. The predicted signal peptide (1-18) and transmembrane (418-440) domains are in bold type. Uncertainty, as to the point of signal peptide cleavage, exists between amino acids 19 to 24. Potential N-glycosylation sites are bolded with underlining and cysteines within the extracellular region are marked with an asterisk. The proline and glycine residues between the discoidin I-type domain and the tyrosine kinase domain are italicized. The bolded with double underlining show the most proline/glycine-rich of the connecting region. The tyrosine within the Asn-Pro-Xaa-Tyr sequence found in the cytoplasmic juxtamembrane sequences of several plasma membrane receptors (Bansal, A. & Gierasch, L. M. (1991) Cell 67, 1195-201, which is incorporated herein by reference) is denoted with a double asterisk.

[0014]FIG. 2 is a schematic representation of DDR. DLD=discoidin I-type domain; TMD=transmembrane domain; TKD=tyrosine kinase domain. The kinked line represents the extent of the most proline/glycine-rich portion of the connecting region. The approximate boundaries of the peptide fused to b-galactosidase for rabbit immunization are indicated by #. The approximate positions of potential sites of N-glycosylation are denoted by asterisks.

[0015]FIG. 3 shows a Northern analysis of DDR transcripts in human cell line total RNAs. Total RNAs (20 micrograms) from human cell lines a BeWo, b T-47D, c MCF-7, d PANC-1, e A431, f U937, g Daudi, h HL-60, i Jurkat, j C32, k HepG2, l HeLa were fractionated on formaldehyde agarose gels, transferred to a nitrocellulose filter and hybridized with a 0.5 kilobase ³²P-labeled PCR product encompassing nucleotides 213 to 772 of the DDR cDNA in 50% formamide at 42° C. After washing and exposure to film for 3 days, the filter was subsequently probed with a ³²P-labeled mouse GAPDH (glyceraldehyde 3-phosphate dehydrogenase) cDNA to confirm that a similar amount of RNA was loaded in each lane. The migrations of the DDR and GAPDH transcripts as well as the ribosomal RNAs are indicated.

[0016]FIG. 4 shows a Northern analysis of DDR transcripts in mouse tissue RNAs. PolyA+ RNAs from adult mouse tissues: a brain, b thymus, c lung, d heart, e liver, f spleen, g small intestine, h kidney, i pancreas, j skeletal muscle, k testis, l ovary, m uterus, n placenta were fractionated on formaldehyde agarose gels, transferred to a nitrocellulose filter and hybridized with a 3.7 kilobase ³²P-labeled EcoR I fragment containing the DDR cDNA in 40% formamide at 42° C. After washing and exposure to film for 3 days, the filter was subsequently probed with a ³²P-labeled mouse GAPDH cDNA to control for the amount of RNA loaded in each lane. The mobilities of RNA markers is indicated in kilobases. The migrations of the DDR and GAPDH (glyceraldehyde 3-phosphate dehydrogenase) transcripts are indicated.

[0017]FIG. 5 shows DDR polypeptide and tyrosine phosphorylation in transfected COS-7 cells and T-47D breast carcinoma cells. Wheat germ agglutinin binding extracts of untransfected COS-7 cells, COS-7 cells transfected with the DDR cDNA and T-47D cells were fractionated on 7.5% NaDodSO₄-PAGE for immunoblot analysis with anti-DDR antiserum e.1 as described below in the Examples section. The filter was stripped of antibodies as described below in the Examples section, incubated with detection reagents to ensure stripping and then tyrosine phosphoproteins were detected with antibody PY20. In other experiments two tyrosine phosphoprotein bands were observed: one the size of DDR and the other at approximately 180 kDa. The mobilities of prestained protein markers are shown at the right.

DESCRIPTION OF THE PREFERRED EMBODIMENT

[0018] The DDR cDNA was isolated during a search for tyrosine kinase cDNAs related to those in the insulin receptor family. This cDNA predicts a protein of 914 amino acids (105 kDa) that has a signal peptide and a transmembrane domain (FIGS. 1 and 2). The predicted DDR protein also has the following features: a discoidin I-type domain near the N-terminus, an extensive proline/glycine-rich region between the discoidin I-type domain and the transmembrane domain, and another extensive proline/glycine-rich region between the transmembrane domain and the C-terminal tyrosine kinase domain. The sequence of the DDR catalytic domain places it within the insulin receptor family of receptor tyrosine kinases (Hanks, S. K. & Quinn, A. M. (1991) Meth. Enz. 200, 38-62, which is incorporated herein by reference). The catalytic domain is 45% identical with the trkA protein catalytic domain (Martin-Zanca, D., Oskam, R., Mitra, G., Copeland, T. & Barbacid, M. (1989) Mol. Cell. Biol. 9, 24-33, which is incorporated herein by reference) but the remainder of the molecule has no similarity to any other portion of the trk proteins. Also like trkA, the DDR protein has a relatively short C-terminal tail following the catalytic domain (8 amino acids in DDR versus 13 in trkA). The C-terminal tail of DDR does not contain tyrosine residues, but the tyrosine residues conserved within the catalytic domains of trkA and the insulin receptor are also conserved in DDR. This includes DDR tyrosines 793, 797, and 798, which by analogy with the insulin receptor are autophosphorylation sites. The exceptionally long cytoplasmic juxtamembrane region contains an Asn-Pro-Ala-Tyr sequence characteristic of the tight turn recognition motif for internalization in coated pits (Chen, W. J., Goldstein, J. L. & Brown, M. S. (1990) J Biol. Chem. 265, 3116-3123 and Bansal, A. & Gierasch, L. M. (1991) Cell 67, 1195-201, which are incorporated herein by reference).

[0019] Northern analysis of multiple human cell lines demonstrated that a 4.0 kilobase DDR transcript is relatively abundant in the human breast carcinoma cell lines T-47D, BT-20 and MCF-7 and also relatively high in the A431 epidermoid carcinoma cell line (FIG. 3). Transcripts hybridizing with the DDR cDNA were found in polyA+ RNAs from multiple mouse tissues, but in widely varying amounts (FIG. 4). Kidney, spleen and placenta had the highest levels of the 4.0 kilobase transcript relative to the levels of GAPDH (glyceraldehyde 3-phosphate dehydrogenase) mRNA in each preparation.

[0020] To characterize the DDR polypeptide, we transfected COS-7 cells with the DDR cDNA in a mammalian expression vector. Wheat germ agglutinin-binding extracts of transfected cells contained a 120 kDa DDR protein that was reactive with antisera developed against a lacZ fusion protein containing a portion of the extracellular domain of the DDR-encoded polypeptide (FIG. 5, anti-DDR). The DDR protein was also specifically reactive with an antiphosphotyrosine antibody presumably due to autophosphorylation (FIG. 5, anti-PY). The DDR protein was also detected in T47D and BT-20 breast carcinoma cell lines, but was not detected in a variety of other human cell lines (FIG. 5 and data not shown). The major tyrosine phosphoprotein present in wheat germ agglutinin-binding extracts of T-47D cells had an identical electrophoretic mobility to the DDR protein (FIG. 5).

[0021] We have characterized a novel putative receptor tyrosine kinase, DDR, that is abundant in breast carcinoma cell lines. DDR has at least two unusual features, the discoidin I-type domain and the extensive proline/glycine-rich regions, not present in other receptor tyrosine kinases. These features suggest that DDR may have an unusual mechanism of transmembrane signaling or an unusual ligand.

[0022] The presence of a discoidin I-type domain in the ectodomain of a receptor tyrosine kinase is provocative. Discoidin I is a Dictyostelium discoideum lectin that participates in cell aggregation (Springer, W. R., Cooper, D. N. W. & Barondes, S. H. (1984) Cell 39, 557-564, which is incorporated herein by reference). Discoidin I-type domains (Poole, S., Firtel, R. A., Lamar, E. & Rowenkamp, W. (1981) J. Mol. Biol. 153, 273-289, which is incorporated herein by reference) are present as tandem repeats at the C-terminus of the light chains of factor V (Kane, W. H. & Davie, E. W. (1986) Proc. Natl. Acad. Sci. U.S.A. 83, 6800-6804, which is incorporated herein by reference) factor VIII (Toole, J. J., Knopf, J. L., Wozney, J. M., Sultzman, L. A., Buecker, J. L., Pittman, D. D., Kaufman, R. J., Brown, E., Shoemaker, C., Orr, E. C., Amphlett, G. W., Foster, W. B., Coe, M. L., Knutson, G. J., Fass, D. N. & Hewick, R. M. (1984) Nature 312, 342-347 and Vehar, G. A., Keyt, B., Eaton, D., Rodriguez, H., O'Brien, D. P., Rotblat, F., Oppermann, H., Keck, R., Wood, W. I., Harkins, R. N., Tuddenham, E. G. D., Lawn, R. M. & Capon, D. J. (1984) Nature 312, 337-342, which are incorporated herein by reference) and two milk fat globule membrane proteins, MFG.E8 (Stubbs, J. D., Lekutis, C., Singer, K. L., Bui, A., Yuzuki, D., Srinivasan, U. & Parry, G. (1990) Proc. Natl. Acad. Sci. USA 87, 8417-8421, which is incorporated herein by reference) and BA46 (Larocca, D., Peterson, J. A., Urrea, R., Kuniyoshi, J., Bistrain, A. M. & Ceriani, R. L. (1991) Cancer Research 51, 4994-4998, which is incorporated herein by reference). The role of the discoidin I-type domains in these proteins is not completely understood, but there is evidence suggesting that the light chains of factor V and VIII interact with specific platelet membrane proteins (Tracey, P. B., Peterson, J. M., Nesheim, M. E., McDuffie, F. C. & Mann, K. G. (1979) J. Biol. Chem. 254, 10354-10361, Tracey, P. B., Nesheim, M. E. & Mann, K. G. (1980) J. Biol. Chem. 255, 662-669, and Nesheim, M., Pittman, D. D., Wang, J. H., Slonosky, D., Giles, A. R. & Kaufman, R. J. (1988) J. Biol. Chem. 263, 16467-16470, which are incorporated herein by reference), and MFG.E8 and BA46 are stably associated with mammary epithelial membranes (Stubbs, J. D., Lekutis, C., Singer, K. L., Bui, A., Yuzuki, D., Srinivasan, U. & Parry, G. (1990) Proc. Natl. Acad. Sci. USA 87, 8417-8421 and Larocca, D., Peterson, J. A., Urrea, R., Kuniyoshi, J., Bistrain, A. M. & Ceriani, R. L. (1991) Cancer Research 51, 4994-4998, which are incorporated herein by reference). Recently, tandem discoidin I-type domains have also been found in the extracellular region of a cell surface transmembrane protein, A5 (Takagi, S., Hirata, T., Agata, K., Mochii, M., Eguchi, G. & Fujisawa, H. (1991) Neuron 7, 295-307, which is incorporated herein by reference), which has a small (44 amino acids) cytoplasmic region. Since this protein has a highly specific localization at the termination site of the optic nerve and is expressed contemporaneously with optic nerve innervation, it has been proposed that A5 is a targeting molecule for retinal axons (Takagi, S., Tsuji, T., Amagai, T. & Fujisawa, H. (1987) Dev. Biol. 122, 90-100, which is incorporated herein by reference). A highly conserved consensus sequence can be derived from an alignment of the discoidin I-type domains in these proteins (Poole, S., Firtel, R. A., Lamar, E. & Rowenkamp, W. (1981) J. Mol. Biol. 153, 273-289, Larocca, D., Peterson, J. A., Urrea, R., Kuniyoshi, J., Bistrain, A. M. & Ceriani, R. L. (1991) Cancer Research 51, 4994-4998, Stubbs, J. D., Lekutis, C., Singer, K. L., Bui, A., Yuzuki, D., Srinivasan, U. & Parry, G. (1990) Proc. Natl. Acad. Sci. USA 87, 8417-8421, Kane, W. H. & Davie, E. W. (1986) Proc. Natl. Acad. Sci. U.S.A. 83, 6800-6804, Vehar, G. A., Keyt, B., Eaton, D., Rodriguez, H., O'Brien, D. P., Rotblat, F., Oppermann, H., Keck, R., Wood, W. I., Harkins, R. N., Tuddenham, E. G. D., Lawn, R. M. & Capon, D. J. (1984) Nature 312, 337-342, and Takagi, S., Hirata, T., Agata, K., Mochii, M., Eguchi, G. & Fujisawa, H. (1991) Neuron 7, 295-307). The mammalian members of the group also have similarities outside the region of similarity with discoidin I. The DDR discoidin I-type domain adheres closely to a consensus sequence defined by the other discoidin domain containing proteins. Analogy with the other discoidin I-type domain containing proteins suggests that the DDR ligand may be a cell surface molecule. Since DDR is relatively abundant in breast carcinoma cells, it is particularly intriguing that mammary epithelial membranes appear to express a component that binds BA46 (Larocca, D., Peterson, J. A., Urrea, R., Kuniyoshi, J., Bistrain, A. M. & Ceriani, R. L. (1991) Cancer Research 51, 4994-4998, which is incorporated herein by reference). Characterization of the DDR ligand could reveal a set of interactions involved in multiple biological regulatory systems.

[0023] The two juxtamembrane regions of DDR suggest a unique signaling mechanism for DDR. Out of 180 residues in the regions indicated in FIGS. 1 and 2, 46 are proline and 19 are glycine (36% P+G). The region between the discoidin I-type domain and this extremely proline/glycine-rich region is also glycine-rich; therefore the 250 amino acid connecting region between the discoidin I-type domain and the tyrosine kinase domain, excluding the transmembrane and stop-transfer regions, is 25% proline/glycine. The extensive hydrophilic proline/glycine-rich region of DDR does not contain collagen-like repeating motifs, nor does it contain sequence patterns characteristic of the family of salivary proline-rich proteins. The 176 residue cytoplasmic juxtamembrane region is the longest described for any receptor tyrosine kinase (over three times longer than the average juxtamembrane span in the insulin receptor family of tyrosine kinases). Proline/glycine-rich regions like the one found in DDR are potentially flexible, but the physical properties of this region remain to be experimentally determined. A proline/glycine-rich region of the type found in DDR has not previously been observed in an integral membrane protein. However, regions of similar composition and length are found within the adaptins, which tether transmembrane proteins (e.g. receptors) to clathrin-coated pits and vesicles (Robinson, M. S. (1989) J. Cell Biol. 108, 833-842, Ponnambalam, S. Robinson, M. S., Jackson, A. P., Peiper, L. & Parham, P. (1990) J. Biol. Chem. 265, 4814-4820, Kirchhausen, T., Nathanson, K. L., Matsui, W., Vaisberg, A., Chow, E. P., Burne, C., Keen, J. H. & Davis, A. E. (1989) Proc. Natl. Acad. Sci. USA 86, 2612-2616, Robinson, M. S. (1990) J. Cell Biol. 111, 2319-2326, Virshup, D. M. & Bennett, V. (1988) J. Cell Biol. 106, 39-50, and Heuser, J. E. & Keen, J. (1988) J. Cell Biol. 107, 877-886, which are incorporated herein by reference). The corresponding regions of a (Robinson, M. S. (1989) J. Cell Biol. 108, 833-842), b (Ponnambalam, S. Robinson, M. S., Jackson, A. P., Peiper, L. & Parham, P. (1990) J. Biol. Chem. 265, 4814-4820 and Kirchhausen, T., Nathanson, K. L., Matsui, W., Vaisberg, A., Chow, E. P., Burne, C., Keen, J. H. & Davis, A. E. (1989) Proc. Natl. Acad. Sci. USA 86, 2612-2616, which are incorporated by reference) and d-adaptins (Robinson, M. S. (1990) J. Cell Biol. 111, 2319-2326, which is incorporated herein by reference) are 34%, 26% and 27% proline+glycine over 96, 152, and 125 residues, respectively. Electron microscopic examination of adaptins demonstrates two globular domains connected by an extended hinge (Virshup, D. M. & Bennett, V. (1988) J. Cell Biol. 106, 39-50 and Heuser, J. E. & Keen, J. (1988) J. Cell Biol. 107, 877-886, which are incorporated herein by reference). The DDR proline/glycine-rich region does not contain sequence patterns similar to the adaptin hinge regions. The a and b adaptins, however, also do not have sequence motifs conserved between their respective hinge regions; it appears that overall amino acid composition determines hinge formation rather than a specific sequence pattern (Takagi, S., Tsuji, T., Amagai, T. & Fujisawa, H. (1987) Dev. Biol. 122, 90-100, which is incorporated herein by reference). The adaptin hinges appear to be required to bridge the physical gap between clathrin and plasma membrane proteins in coated pits or vesicles (Ponnambalam, S. Robinson, M. S., Jackson, A. P., Peiper, L. & Parham, P. (1990) J. Biol. Chem. 265, 4814-4820, Kirchhausen, T., Nathanson, K. L., Matsui, W., Vaisberg, A., Chow, E. P., Burne, C., Keen, J. H. & Davis, A. E. (1989) Proc. Natl. Acad. Sci. USA 86, 2612-2616, Robinson, M. S. (1990) J. Cell Biol. 111, 2319-2326, Virshup, D. M. & Bennett, V. (1988) J. Cell Biol. 106, 39-50, and Heuser, J. E. & Keen, J. (1988) J. Cell Biol. 107, 877-886, which are incorporated herein by reference). The proline/glycine-rich regions of the DDR may serve an analogous tethering function and allow a unique geometry in interaction with the ligand or allow access of the tyrosine kinase domain to a unique set of substrates.

[0024] Elucidation of the physical characteristics and ligand binding properties of DDR should provide insight into a unique transmembrane signaling process.

[0025] Several important uses of the subject invention are disclosed. First, the DDR cDNA or antibody reagents generated from the sequence may be useful for diagnostic or prognostic analysis of tumors of the breast and lung. Antibodies to the extracellular domain may be useful for screening blood or other tissue samples.

[0026] Second, the recombinant DDR extracellular domain should allow purification and characterization of the ligand. Identification of the ligand is important for designing therapeutic agents which might act via the DDR to influence the behavior of breast and lung tumors.

[0027] Third, the recombinant DDR extracellular domain is a potential immunogen for active immunotherapy of breast and lung cancer.

[0028] Fourth, the recombinant DDR extracellular domain may block the normal function of the receptor in vivo by occupying the ligand. This could inhibit the growth of tumors that require growth signals from activated DDR.

[0029] Clarifying and expanding on the above possible uses, the primary sequence of the DDR cDNA predicts a transmembrane protein that contains an N-terminal discoidin I-like domain, a membrane spanning region and a C-terminal tyrosine kinase domain. The presence of a tyrosine kinase domain suggests that the DDR protein is involved in transmembrane control of the growth status of the cell. The presence of an extracellular discoidin I-type domain, which is unique to DDR among receptor tyrosine kinases, suggests that DDR interacts with a ligand found on cell surfaces or within the extracellular matrix. Since discoidin I is a lectin, it is possible that the DDR ligand will have a carbohydrate component.

[0030] The DDR cDNA has also provided the means to define the nature of the ligand. Recombinant forms of the subject DDR cDNA have been used to obtain protein reagents corresponding to fragments or rearrangements of the predicted protein. The N-terminal portion of the DDR cDNA has been truncated to encode an extracellular domain fragment corresponding to amino acids 1-387 of the predicted protein sequence, which includes the signal peptide. the DLD and a portion of the juxtamembrane region, to which specific antiserum has been obtained. Identification of the ligand involves development of binding assays to determine the localization of the ligand within tissues and to define the molecular determinants of DDR ligand-receptor interactions. This type of binding assay will permit determining the exact carbohydrate binding determinants.

[0031] It is clear that DDR itself is present in several different types of human tumor cell lines that originated in specific types of epithelia. If the DDR ligand is present within such tumors, then blocking its interaction with DDR may influence the survival of the tumor. Soluble DDR derived reagents may be able to directly block the DDR-ligand interactions and could be used to screen compounds for the ability to block the receptor-ligand interaction. For example, if there is a carbohydrate component of interaction of the ligand with DDR, specific carbohydrates may be able to block growth stimuli normally or abnormally mediated by the DDR polypeptide.

[0032] A portion of the DDR cDNA has been used to aid in generating a polyclonal antibody specific for the extracellular domain of the DDR polypeptide. This and other antibodies which are generated as a result of the DDR cDNA sequences could have prognostic value in the analysis of specific tumors of epithelial origin. Since the extracellular domain of receptors are often shed from the cell surface and can be subsequently found in serum samples, antibodies such as the one generated here could be useful for screening blood samples for the presence of the DDR extracellular domain. If the DDR molecule is found to have prognostic value for specific tumors, the ability assay for its presence in blood samples could have value for non-invasive screening and diagnosis.

[0033] The DNA sequence disclosed herein has been assigned Genbank accession number L11315.

EXAMPLES Example 1 cDNA Cloning and Characterization

[0034] A full term human placental Igt10 cDNA library was screened with a ³²P-labeled antisense oligonucleotide of the sequence 5′-GTT(G/C)CG(A/G)GC(A/G)GCCAG(A/G)TC-(G/C)CG(A/G)TG-3′ (SEQ ID NO: 3), corresponding to the His-Arg-Asp-Leu-Ala-Ala-Arg-Asn amino acid sequence found in many tyrosine kinases, by methods described previously (Fischman, K., Edman, J. C., Shacklefor, G. M., Turner, J. A., Rutter, W. J. & Nir, U. (1990) Mol. Cell. Biol. 10, 146-153, which is incorporated herein by reference). Positive clones were grouped by cross-hybridization and plaque isolated for subcloning in M13 derivatives or plasmids for sequencing with Sequenase (USB) and an ABI 370A automated sequencer. Three individual clones from a class of 19 had identical nucleotide sequence tags and overlapping restriction maps. Both strands of the DDR cDNA in one of these were sequenced in their entirety. This sequence has one long open reading frame (nucleotides 88 to 2883) which is followed by a polyadenylation signal (at nucleotide 3721) and a polyadenine tract (SEQ ID NO: 1). The first in-frame methionine (at nucleotide 142) has a Kozak consensus sequence (Kozak, M. (1987) Nucleic Acids Res. 15, 8125-8148, which is incorporated herein by reference) followed by a predicted signal peptide that initiates a precursor protein of 914 amino acids (SEQ ID NO: 2). A second hydrophobic region (residues 419-437) conforms to expectations for a membrane-spanning domain.

Example 2 Northern Analysis

[0035] RNAs were prepared using RNAzol (CinnaBiotecx) reagent and protocols. Formaldehyde-agarose fractionated RNAs were transferred to nitrocellulose and hybridized as described previously (Harlow, E. & Lane, D. (1988) Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., which is incorporated herein by reference) with ³²P-labeled DDR cDNA fragments described in the figure descriptions. Filters were subsequently probed with a ³²P-labeled GAPDH (glyceraldehyde 3-phosphate dehydrogenase) cDNA.

Example 3 Production of Anti-DDR Antisera E.1

[0036] A Sal I-Eco RI fragment of the DDR cDNA was subcloned to pUR290 to allow production of a b-galactosidase fusion (Harlow, E. & Lane, D. (1988) Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., which is incorporated herein by reference) of DDR residues 223 to 346 within the extracellular region of the protein. The fusion protein was purified by NaDodSO₄-PAGE for rabbit immunization (Harlow, E. & Lane, D. (1988) Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., which is incorporated herein by reference).

Example 4 Transient Expression of Human DDR in COS-7 Cells

[0037] The DDR cDNA was subcloned into the Eco RI site of the mammalian expression vector pECE (Ellis, L., Clauser, E., Morgan, D. O., Edery, M., Roth, R. A. & Rutter, W. J. (1986) Cell 45, 721-732, which is incorporated herein by reference). This plasmid, pSV33, was used to transfect COS-7 cells by the DEAE-dextran method (Kaufman, R. J. (1990) Meth. Enz. 185, 487-511, which is incorporated herein by reference). After 48 hours the cells were solubilized in NaDodSO₄-PAGE sample buffer for immunoblotting.

Example 5 Immunoblotting

[0038] Cells on 10 cm plates were solubilized in 1.0 ml of RIPA buffer containing 1% Triton X-100, 20 mM Tris-HCl pH 7.5, 50 mM sodium chloride, 1 mM sodium orthovanadate, 1 mM PMSF, 50 mM sodium fluoride, 5 mM EDTA and 20 mM sodium pyrophosphate at 4° C. Samples were centrifuged for 15 minutes at 10,000×g at 4° C. and the supernatants were rocked with 50 ml of wheat germ agglutinin agarose (Vector) for 1.5 h. The beads were washed three times with RIPA buffer and boiled in 100 ml of NaDodSO₄-PAGE sample buffer. Samples were fractionated on 7.5% NaDodSO₄-PAGE, transferred to Immobilon-P (Millipore) and incubated with a 1:500 dilution of rabbit antiserum e.1 (see above) in 5% nonfat dry milk in phosphate buffered saline and subsequently incubated with horseradish peroxidase (HRP)-coupled sheep anti-rabbit IgG for detection with ECL (Amersham) reagents. The filters were stripped of antibodies with 100 mM 2-mercaptoethanol, 2% NaDodSO₄, 62.5 mM Tris-HCl pH 6.7 for 30 minutes at 50° C. The filters were washed and incubated with HRP-sheep anti-rabbit IgG and ECL reagents as before to ensure that all antibodies were removed. The filters were then incubated with a 1:1000 dilution of the monoclonal antiphosphotyrosine antibody PY20 (ICN), and subsequently with ¹²⁵I-protein A (ICN) (Harlow, E. & Lane, D. (1988) Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., which is incorporated herein by reference) prior to autoradiography.

[0039] The invention has now been explained with reference to specific embodiments. Other embodiments will be suggested to those of ordinary skill in the appropriate art upon review of the present specification.

[0040] Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be obvious that certain changes and modifications may be practiced within the scope of the appended claims.

1 6 1 3754 DNA Homo sapiens CDS (142)..(2883) discoidin domain receptor (DDR) tyrosine kinase 1 cccgggtcgg accgcctggg tctgccggga agagcgatga gaggtgtctg aaggtggcta 60 ttcactgagc gatggggttg gacttgaagg aatgccaaga gatgctgccc ccaccccctt 120 aggcccgagg gatcaggagc t atg gga cca gag gcc ctg tca tct tta ctg 171 Met Gly Pro Glu Ala Leu Ser Ser Leu Leu 1 5 10 ctg ctg ctc ttg gtg gca agt gga gat gct gac atg aag gga cat ttt 219 Leu Leu Leu Leu Val Ala Ser Gly Asp Ala Asp Met Lys Gly His Phe 15 20 25 gat cct gcc aag tgc cgc tat gcc ctg ggc atg cag gac cgg acc atc 267 Asp Pro Ala Lys Cys Arg Tyr Ala Leu Gly Met Gln Asp Arg Thr Ile 30 35 40 cca gac agt gac atc tct gct tcc agc tcc tgg tca gat tcc act gcc 315 Pro Asp Ser Asp Ile Ser Ala Ser Ser Ser Trp Ser Asp Ser Thr Ala 45 50 55 gcc cgc cac agc agg ttg gag agc agt gac ggg gat ggg gcc tgg tgc 363 Ala Arg His Ser Arg Leu Glu Ser Ser Asp Gly Asp Gly Ala Trp Cys 60 65 70 ccc gca ggg tcg gtg ttt ccc aag gag gag gag tac ttg cag gtg gat 411 Pro Ala Gly Ser Val Phe Pro Lys Glu Glu Glu Tyr Leu Gln Val Asp 75 80 85 90 cta caa cga ctc cac ctg gtg gct ctg gtg ggc acc cag gga cgg cat 459 Leu Gln Arg Leu His Leu Val Ala Leu Val Gly Thr Gln Gly Arg His 95 100 105 gcc ggg ggc ctg ggc aag gag ttc tcc cgg agc tac cgg ctg cgt tac 507 Ala Gly Gly Leu Gly Lys Glu Phe Ser Arg Ser Tyr Arg Leu Arg Tyr 110 115 120 tcc cgg gat ggt cgc cgc tgg atg ggc tgg aag gac cgc tgg ggt cag 555 Ser Arg Asp Gly Arg Arg Trp Met Gly Trp Lys Asp Arg Trp Gly Gln 125 130 135 gag gtg atc tca ggc aat gag gac cct gag gga gtg gtg ctg aag gac 603 Glu Val Ile Ser Gly Asn Glu Asp Pro Glu Gly Val Val Leu Lys Asp 140 145 150 ctt ggg ccc ccc atg gtt gcc cga ctg gtt cgc ttc tac ccc cgg gct 651 Leu Gly Pro Pro Met Val Ala Arg Leu Val Arg Phe Tyr Pro Arg Ala 155 160 165 170 gac cgg gtc atg agc gtc tgt ctg cgg gta gag ctc tat ggc tgc ctc 699 Asp Arg Val Met Ser Val Cys Leu Arg Val Glu Leu Tyr Gly Cys Leu 175 180 185 tgg agg gat gga ctc ctg tct tac acc gcc cct gtg ggg cag aca atg 747 Trp Arg Asp Gly Leu Leu Ser Tyr Thr Ala Pro Val Gly Gln Thr Met 190 195 200 tat tta tct gag gcc gtg tac ctc aac gac tcc acc tat gac gga cat 795 Tyr Leu Ser Glu Ala Val Tyr Leu Asn Asp Ser Thr Tyr Asp Gly His 205 210 215 acc gtg ggc gga ctg cag tat ggg ggt ctg ggc cag ctg gca gat ggt 843 Thr Val Gly Gly Leu Gln Tyr Gly Gly Leu Gly Gln Leu Ala Asp Gly 220 225 230 gtg gtg ggg ctg gat gac ttt agg aag agt cag gag ctg cgg gtc tgg 891 Val Val Gly Leu Asp Asp Phe Arg Lys Ser Gln Glu Leu Arg Val Trp 235 240 245 250 cca ggc tat gac tat gtg gga tgg agc aac cac agc ttc tcc agt ggc 939 Pro Gly Tyr Asp Tyr Val Gly Trp Ser Asn His Ser Phe Ser Ser Gly 255 260 265 tat gtg gag atg gag ttt gag ttt gac cgg ctg agg gcc ttc cag gct 987 Tyr Val Glu Met Glu Phe Glu Phe Asp Arg Leu Arg Ala Phe Gln Ala 270 275 280 atg cag gtc cac tgt aac aac atg cac acg ctg gga gcc cgt ctg cct 1035 Met Gln Val His Cys Asn Asn Met His Thr Leu Gly Ala Arg Leu Pro 285 290 295 ggc ggg gtg gaa tgt cgc ttc cgg cgt ggc cct gcc atg gcc tgg gag 1083 Gly Gly Val Glu Cys Arg Phe Arg Arg Gly Pro Ala Met Ala Trp Glu 300 305 310 ggg gag ccc atg cgc cac aac cta ggg ggc aac ctg ggg gac ccc aga 1131 Gly Glu Pro Met Arg His Asn Leu Gly Gly Asn Leu Gly Asp Pro Arg 315 320 325 330 gcc cgg gct gtc tca gtg ccc ctt ggc ggc cgt gtg gct cgc ttt ctg 1179 Ala Arg Ala Val Ser Val Pro Leu Gly Gly Arg Val Ala Arg Phe Leu 335 340 345 cag tgc cgc ttc ctc ttt gcg ggg ccc tgg tta ctc ttc agc gaa atc 1227 Gln Cys Arg Phe Leu Phe Ala Gly Pro Trp Leu Leu Phe Ser Glu Ile 350 355 360 tcc ttc atc tct gat gtg gtg aac aat tcc tct ccg gca ctg gga ggc 1275 Ser Phe Ile Ser Asp Val Val Asn Asn Ser Ser Pro Ala Leu Gly Gly 365 370 375 acc ttc ccg cca gcc ccc tgg tgg ccg cct ggc cca cct ccc acc aac 1323 Thr Phe Pro Pro Ala Pro Trp Trp Pro Pro Gly Pro Pro Pro Thr Asn 380 385 390 ttc agc agc ttg gag ctg gag ccc aga ggc cag cca agg ccc gtg gcc 1371 Phe Ser Ser Leu Glu Leu Glu Pro Arg Gly Gln Pro Arg Pro Val Ala 395 400 405 410 aag gcc gag ggg agc ccg acc gcc atc ctc atc ggc tgc ctg gtg gcc 1419 Lys Ala Glu Gly Ser Pro Thr Ala Ile Leu Ile Gly Cys Leu Val Ala 415 420 425 atc atc ctg ctc ctg ctg ctc atc att gcc ctc atg ctc tgg cgg ctg 1467 Ile Ile Leu Leu Leu Leu Leu Ile Ile Ala Leu Met Leu Trp Arg Leu 430 435 440 cac tgg cgc agg ctc ctc agc aag gct gaa cgg agg gtg ttg gaa gag 1515 His Trp Arg Arg Leu Leu Ser Lys Ala Glu Arg Arg Val Leu Glu Glu 445 450 455 gag ctg acg gtt cac ctc tct gtc cct ggg gac act atc ctc atc aac 1563 Glu Leu Thr Val His Leu Ser Val Pro Gly Asp Thr Ile Leu Ile Asn 460 465 470 aac cgc cca ggt cct aga gag cca ccc ccg tac cag gag ccc cgg cct 1611 Asn Arg Pro Gly Pro Arg Glu Pro Pro Pro Tyr Gln Glu Pro Arg Pro 475 480 485 490 cgt ggg aat ccg ccc cac tcc gct ccc tgt gtc ccc aat ggc tct gcg 1659 Arg Gly Asn Pro Pro His Ser Ala Pro Cys Val Pro Asn Gly Ser Ala 495 500 505 ttg ctg ctc tcc aat cca gcc tac cgc ctc ctt ctg gcc act tac gcc 1707 Leu Leu Leu Ser Asn Pro Ala Tyr Arg Leu Leu Leu Ala Thr Tyr Ala 510 515 520 cgt ccc cct cga ggc ccg ggc ccc ccc aca ccc gcc tgg gcc aaa ccc 1755 Arg Pro Pro Arg Gly Pro Gly Pro Pro Thr Pro Ala Trp Ala Lys Pro 525 530 535 acc aac acc cag gcc tac agt ggg gac tat atg gag cct gag aag cca 1803 Thr Asn Thr Gln Ala Tyr Ser Gly Asp Tyr Met Glu Pro Glu Lys Pro 540 545 550 ggc gcc ccg ctt ctg ccc cca cct ccc cag aac agc gtc ccc cat tat 1851 Gly Ala Pro Leu Leu Pro Pro Pro Pro Gln Asn Ser Val Pro His Tyr 555 560 565 570 gcc gag gct gac att gtt acc ctg cag ggc gtc acc ggg ggc aac acc 1899 Ala Glu Ala Asp Ile Val Thr Leu Gln Gly Val Thr Gly Gly Asn Thr 575 580 585 tat gct gtg cct gca cct ccc cca ggg gca gtc ggg gat ggg ccc ccc 1947 Tyr Ala Val Pro Ala Pro Pro Pro Gly Ala Val Gly Asp Gly Pro Pro 590 595 600 aga gtg gat ttc cct cga tct cga ctc cgc ttc aag gag aag ctt ggc 1995 Arg Val Asp Phe Pro Arg Ser Arg Leu Arg Phe Lys Glu Lys Leu Gly 605 610 615 gag ggc cag ttt ggg gag gtg cac ctg tgt gag gtc gac agc cct caa 2043 Glu Gly Gln Phe Gly Glu Val His Leu Cys Glu Val Asp Ser Pro Gln 620 625 630 gat ctg gtt agt ctt gat ttc ccc ctt aat gtg cgt aag gga cac cct 2091 Asp Leu Val Ser Leu Asp Phe Pro Leu Asn Val Arg Lys Gly His Pro 635 640 645 650 ttg ctg gta gct gtc aag atc tta cgg cca gat gcc acc aag aat gcc 2139 Leu Leu Val Ala Val Lys Ile Leu Arg Pro Asp Ala Thr Lys Asn Ala 655 660 665 agg aat gat ttc ctg aaa gag gtg aag atc atg tcg agg ctc aag gac 2187 Arg Asn Asp Phe Leu Lys Glu Val Lys Ile Met Ser Arg Leu Lys Asp 670 675 680 cca aac atc att cgg ctg ctg ggc gtg tgt gtg cag gac gac ccc ctc 2235 Pro Asn Ile Ile Arg Leu Leu Gly Val Cys Val Gln Asp Asp Pro Leu 685 690 695 tgc atg att act gac tac atg gag aac ggc gac ctc aac cag ttc ctc 2283 Cys Met Ile Thr Asp Tyr Met Glu Asn Gly Asp Leu Asn Gln Phe Leu 700 705 710 agt gcc cac cag ctg gag gac aag gca gcc gag ggg gcc cct ggg gac 2331 Ser Ala His Gln Leu Glu Asp Lys Ala Ala Glu Gly Ala Pro Gly Asp 715 720 725 730 ggg cag gct gcg cag ggg ccc acc atc agc tac cca atg ctg ctg cat 2379 Gly Gln Ala Ala Gln Gly Pro Thr Ile Ser Tyr Pro Met Leu Leu His 735 740 745 gtg gca gcc cag atc gcc tcc ggc atg cgc tat ctg gcc aca ctc aac 2427 Val Ala Ala Gln Ile Ala Ser Gly Met Arg Tyr Leu Ala Thr Leu Asn 750 755 760 ttt gta cat cgg gac ctg gcc acg cgg aac tgc cta gtt ggg gaa aat 2475 Phe Val His Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Glu Asn 765 770 775 ttc acc atc aaa atc gca gac ttt ggc atg agc cgg aac ctc tat gct 2523 Phe Thr Ile Lys Ile Ala Asp Phe Gly Met Ser Arg Asn Leu Tyr Ala 780 785 790 ggg gac tat tac cgt gtg cag ggc cgg gca gtg ctg ccc atc cgc tgg 2571 Gly Asp Tyr Tyr Arg Val Gln Gly Arg Ala Val Leu Pro Ile Arg Trp 795 800 805 810 atg gcc tgg gag tgc atc ctc atg ggg aag ttc acg act gcg agt gac 2619 Met Ala Trp Glu Cys Ile Leu Met Gly Lys Phe Thr Thr Ala Ser Asp 815 820 825 gtg tgg gcc ttt ggt gtg acc ctg tgg gag gtg ctg atg ctc tgt agg 2667 Val Trp Ala Phe Gly Val Thr Leu Trp Glu Val Leu Met Leu Cys Arg 830 835 840 gcc cag ccc ttt ggg cag ctc acc gac gag cag gtc atc gag aac gcg 2715 Ala Gln Pro Phe Gly Gln Leu Thr Asp Glu Gln Val Ile Glu Asn Ala 845 850 855 ggg gag ttc ttc cgg gac cag ggc cgg cag gtg tac ctg tcc cgg ccg 2763 Gly Glu Phe Phe Arg Asp Gln Gly Arg Gln Val Tyr Leu Ser Arg Pro 860 865 870 cct gcc tgc ccg cag ggc cta tat gag ctg atg ctt cgg tgc tgg agc 2811 Pro Ala Cys Pro Gln Gly Leu Tyr Glu Leu Met Leu Arg Cys Trp Ser 875 880 885 890 cgg gag tct gag cag cga cca ccc ttt tcc cag ctg cat cgg ttc ctg 2859 Arg Glu Ser Glu Gln Arg Pro Pro Phe Ser Gln Leu His Arg Phe Leu 895 900 905 gca gag gat gca ctc aac acg gtg tgaatcacac atccagctgc ccctccctca 2913 Ala Glu Asp Ala Leu Asn Thr Val 910 gggagcgatc caggggaagc cagtgacact aaaacaagag gacacaatgg cacctctgcc 2973 ccttcccctc ccgacagccc atcacctcta atagaggcag tgagactgca ggctgggccc 3033 acccagggag ctgatgcccc ttctcccctt cctggacaca ctctcatgtc cccttcctgt 3093 tcttccttcc tagaagcccc tgtcgcccac ccagctggtc ctgtggatgg gatcctctcc 3153 acccacctct agccatccct tggggaaggg tggggagaaa tataggatag acactggaca 3213 tggcccattg gagcacctgg gccccactgg acaacactga ttcctggaca ggtggctgcg 3273 cccccagctt ctctctccct gtcacacact ggaccccact ggctgagaat ctgggggtga 3333 ggaggacaag aaggagagga aaatgtttcc ttgtgcctgc tcctgtactt gtcctcagct 3393 tgggcttctt cctcctccat cacctgaaac actggacctg ggggtagccc cgccccagcc 3453 ctcagtcacc ccccacttcc cacctgcagt cttgtagcta gaacttctct aagcctatac 3513 gtttctgtgg agtaaatatt gggattgggg ggaaagaggg agcaacggcc catagccttg 3573 gggttggaca tctctagtgt agctgccaca ttgatttttc tataatcact tgggtttgta 3633 catttttggg gggagagaca cagattttta cactaatata tggacctagc ttgaggcaat 3693 tttaatcccc tgcactaggc aggtaataat aaaggttgag ttttccacaa aaaaaaaaaa 3753 a 3754 2 914 PRT Homo sapiens PEPTIDE (1)..(18) signal peptide 2 Met Gly Pro Glu Ala Leu Ser Ser Leu Leu Leu Leu Leu Leu Val Ala 1 5 10 15 Ser Gly Asp Ala Asp Met Lys Gly His Phe Asp Pro Ala Lys Cys Arg 20 25 30 Tyr Ala Leu Gly Met Gln Asp Arg Thr Ile Pro Asp Ser Asp Ile Ser 35 40 45 Ala Ser Ser Ser Trp Ser Asp Ser Thr Ala Ala Arg His Ser Arg Leu 50 55 60 Glu Ser Ser Asp Gly Asp Gly Ala Trp Cys Pro Ala Gly Ser Val Phe 65 70 75 80 Pro Lys Glu Glu Glu Tyr Leu Gln Val Asp Leu Gln Arg Leu His Leu 85 90 95 Val Ala Leu Val Gly Thr Gln Gly Arg His Ala Gly Gly Leu Gly Lys 100 105 110 Glu Phe Ser Arg Ser Tyr Arg Leu Arg Tyr Ser Arg Asp Gly Arg Arg 115 120 125 Trp Met Gly Trp Lys Asp Arg Trp Gly Gln Glu Val Ile Ser Gly Asn 130 135 140 Glu Asp Pro Glu Gly Val Val Leu Lys Asp Leu Gly Pro Pro Met Val 145 150 155 160 Ala Arg Leu Val Arg Phe Tyr Pro Arg Ala Asp Arg Val Met Ser Val 165 170 175 Cys Leu Arg Val Glu Leu Tyr Gly Cys Leu Trp Arg Asp Gly Leu Leu 180 185 190 Ser Tyr Thr Ala Pro Val Gly Gln Thr Met Tyr Leu Ser Glu Ala Val 195 200 205 Tyr Leu Asn Asp Ser Thr Tyr Asp Gly His Thr Val Gly Gly Leu Gln 210 215 220 Tyr Gly Gly Leu Gly Gln Leu Ala Asp Gly Val Val Gly Leu Asp Asp 225 230 235 240 Phe Arg Lys Ser Gln Glu Leu Arg Val Trp Pro Gly Tyr Asp Tyr Val 245 250 255 Gly Trp Ser Asn His Ser Phe Ser Ser Gly Tyr Val Glu Met Glu Phe 260 265 270 Glu Phe Asp Arg Leu Arg Ala Phe Gln Ala Met Gln Val His Cys Asn 275 280 285 Asn Met His Thr Leu Gly Ala Arg Leu Pro Gly Gly Val Glu Cys Arg 290 295 300 Phe Arg Arg Gly Pro Ala Met Ala Trp Glu Gly Glu Pro Met Arg His 305 310 315 320 Asn Leu Gly Gly Asn Leu Gly Asp Pro Arg Ala Arg Ala Val Ser Val 325 330 335 Pro Leu Gly Gly Arg Val Ala Arg Phe Leu Gln Cys Arg Phe Leu Phe 340 345 350 Ala Gly Pro Trp Leu Leu Phe Ser Glu Ile Ser Phe Ile Ser Asp Val 355 360 365 Val Asn Asn Ser Ser Pro Ala Leu Gly Gly Thr Phe Pro Pro Ala Pro 370 375 380 Trp Trp Pro Pro Gly Pro Pro Pro Thr Asn Phe Ser Ser Leu Glu Leu 385 390 395 400 Glu Pro Arg Gly Gln Pro Arg Pro Val Ala Lys Ala Glu Gly Ser Pro 405 410 415 Thr Ala Ile Leu Ile Gly Cys Leu Val Ala Ile Ile Leu Leu Leu Leu 420 425 430 Leu Ile Ile Ala Leu Met Leu Trp Arg Leu His Trp Arg Arg Leu Leu 435 440 445 Ser Lys Ala Glu Arg Arg Val Leu Glu Glu Glu Leu Thr Val His Leu 450 455 460 Ser Val Pro Gly Asp Thr Ile Leu Ile Asn Asn Arg Pro Gly Pro Arg 465 470 475 480 Glu Pro Pro Pro Tyr Gln Glu Pro Arg Pro Arg Gly Asn Pro Pro His 485 490 495 Ser Ala Pro Cys Val Pro Asn Gly Ser Ala Leu Leu Leu Ser Asn Pro 500 505 510 Ala Tyr Arg Leu Leu Leu Ala Thr Tyr Ala Arg Pro Pro Arg Gly Pro 515 520 525 Gly Pro Pro Thr Pro Ala Trp Ala Lys Pro Thr Asn Thr Gln Ala Tyr 530 535 540 Ser Gly Asp Tyr Met Glu Pro Glu Lys Pro Gly Ala Pro Leu Leu Pro 545 550 555 560 Pro Pro Pro Gln Asn Ser Val Pro His Tyr Ala Glu Ala Asp Ile Val 565 570 575 Thr Leu Gln Gly Val Thr Gly Gly Asn Thr Tyr Ala Val Pro Ala Pro 580 585 590 Pro Pro Gly Ala Val Gly Asp Gly Pro Pro Arg Val Asp Phe Pro Arg 595 600 605 Ser Arg Leu Arg Phe Lys Glu Lys Leu Gly Glu Gly Gln Phe Gly Glu 610 615 620 Val His Leu Cys Glu Val Asp Ser Pro Gln Asp Leu Val Ser Leu Asp 625 630 635 640 Phe Pro Leu Asn Val Arg Lys Gly His Pro Leu Leu Val Ala Val Lys 645 650 655 Ile Leu Arg Pro Asp Ala Thr Lys Asn Ala Arg Asn Asp Phe Leu Lys 660 665 670 Glu Val Lys Ile Met Ser Arg Leu Lys Asp Pro Asn Ile Ile Arg Leu 675 680 685 Leu Gly Val Cys Val Gln Asp Asp Pro Leu Cys Met Ile Thr Asp Tyr 690 695 700 Met Glu Asn Gly Asp Leu Asn Gln Phe Leu Ser Ala His Gln Leu Glu 705 710 715 720 Asp Lys Ala Ala Glu Gly Ala Pro Gly Asp Gly Gln Ala Ala Gln Gly 725 730 735 Pro Thr Ile Ser Tyr Pro Met Leu Leu His Val Ala Ala Gln Ile Ala 740 745 750 Ser Gly Met Arg Tyr Leu Ala Thr Leu Asn Phe Val His Arg Asp Leu 755 760 765 Ala Thr Arg Asn Cys Leu Val Gly Glu Asn Phe Thr Ile Lys Ile Ala 770 775 780 Asp Phe Gly Met Ser Arg Asn Leu Tyr Ala Gly Asp Tyr Tyr Arg Val 785 790 795 800 Gln Gly Arg Ala Val Leu Pro Ile Arg Trp Met Ala Trp Glu Cys Ile 805 810 815 Leu Met Gly Lys Phe Thr Thr Ala Ser Asp Val Trp Ala Phe Gly Val 820 825 830 Thr Leu Trp Glu Val Leu Met Leu Cys Arg Ala Gln Pro Phe Gly Gln 835 840 845 Leu Thr Asp Glu Gln Val Ile Glu Asn Ala Gly Glu Phe Phe Arg Asp 850 855 860 Gln Gly Arg Gln Val Tyr Leu Ser Arg Pro Pro Ala Cys Pro Gln Gly 865 870 875 880 Leu Tyr Glu Leu Met Leu Arg Cys Trp Ser Arg Glu Ser Glu Gln Arg 885 890 895 Pro Pro Phe Ser Gln Leu His Arg Phe Leu Ala Glu Asp Ala Leu Asn 900 905 910 Thr Val 3 24 DNA Artificial Sequence Description of Artificial Sequenceantisense oligonucleotide 3 gttscgrgcr gccagrtcsc grtg 24 4 4 PRT Artificial Sequence Description of Artificial Sequencesequence found in the cytoplasmic juxtamembrane sequences of several plasma membrane receptors 4 Asn Pro Xaa Tyr 1 5 4 PRT Artificial Sequence Description of Artificial Sequencetight turn recognition motif for internalization in coated pits 5 Asn Pro Ala Tyr 1 6 8 PRT Artificial Sequence Description of Artificial Sequenceamino acid sequence found in many tyrosine kinases corresponding to the antisense oligonucleotide in SEQ ID NO3 6 His Arg Asp Leu Ala Ala Arg Asn 1 5 

What is claimed is:
 1. A composition of matter comprising a polypeptide having a first domain with carbohydrate binding activity and a second domain with kinase activity.
 2. A polypeptide according to claim 1, wherein said polypeptide further comprises a third transmembrane domain connecting said first and said second domains.
 3. A polypeptide according to claim 1, wherein said kinase activity is tyrosine kinase activity.
 4. A composition of matter comprising a polypeptide having a first domain with discoidin-type ligand binding characteristics and a second domain with tyrosine kinase activity.
 5. A polypeptide according to claim 4, wherein said polypeptide comprises essentially a sequence of amino acids shown in FIG. 1 (SEQ ID NO: 2) or a portion thereof which retains said first domain with discoidin-type ligand binding characteristics and said second domain with tyrosine kinase activity.
 6. A composition of matter comprising a polypeptide having a domain with discoidin-type ligand binding characteristics, wherein said polypeptide comprises essentially a sequence of amino acids shown in FIG. 1 (SEQ ID NO: 2) or a portion thereof which retains said domain with discoidin-type ligand binding characteristics.
 7. A composition of matter comprising a polypeptide having a first domain with discoidin-type ligand binding characteristics, a second domain with tyrosine kinase activity, and a third transmembrane domain that connects said first and said second domains.
 8. A polypeptide according to claim 7, wherein said polypeptide comprises essentially a sequence of amino acids shown in FIG. 1 (SEQ ID NO: 2) or portion thereof which retains said first, second, and third domains.
 9. A polypeptide according to claim 7, wherein said polypeptide comprises a sequence of amino acids encoded by essentially a sequence of nucleotide bases assigned Genbank accession number L11315 or a portion thereof which retains said first, second, and third domains or a portion thereof which retains said first, second, and third domains.
 10. A composition of matter comprising a polypeptide having a sequence of amino acids shown in FIG. 1 (SEQ ID NO: 2) or a portion thereof sufficient to provide a antibody or equivalent molecule binding site in said polypeptide.
 11. A polypeptide according to claim 10, wherein said antibody or equivalent molecule binding site is proximate to or in a discoidin-type binding domain in said polypeptide.
 12. A composition of matter comprising a nucleic acid encoding a polypeptide having a first domain with carbohydrate binding activity and a second domain with kinase activity.
 13. A nucleic acid according to claim 12, wherein said kinase activity is tyrosine kinase activity.
 14. A nucleic acid according to claim 12, wherein said nucleic acid encodes in said first polypeptide domain a sequence of amino acids having discoidin-type ligand binding characteristics.
 15. A nucleic acid according to claim 12, wherein said nucleic acid comprises essentially the sequence of nucleotide bases shown in FIG. 1 (SEQ ID: 1) or a portion thereof.
 16. A nucleic acid according to claim 12, wherein said nucleic acid comprises essentially a sequence of nucleotide bases assigned Genbank accession number L11315 or a portion thereof.
 17. A composition of matter comprising essentially a sequence of nucleotide bases shown in FIG. 1 (SEQ ID NO: 1) or a portion thereof that encodes a polypeptide having a first discoidin-type binding domain.
 18. A sequence of nucleotide bases according to claim 17, wherein said sequence further encodes said polypeptide having a second domain with kinase activity.
 19. A sequence of nucleotide bases according to claim 18, wherein said kinase activity is tyrosine kinase activity.
 20. A sequence of nucleotide bases according to claim 18, wherein said sequence further encodes said polypeptide having a third transmembrane domain that connects said first and said second domains.
 21. A diagnostic or prognostic method for detecting the presence of a polypeptide that is produced in tumors of epithelial type cells and therefore an indicia for the presence of such tumors, comprising the steps of: a) producing an antibody equivalent molecule with binding specificity directed to a polypeptide having a sequence of amino acids shown in FIG. 1 (SEQ ID NO: 2) or a portion thereof sufficient to provide said antibody or equivalent molecule a binding site in said polypeptide; b) reacting said produced antibody or equivalent molecule having said polypeptide binding specificity with a test sample that possible contains said polypeptide; and c) analyzing said reacted test sample for binding of said antibody or equivalent molecule to said polypeptide, thereby indicating the presence of said polypeptide.
 22. A method according to claim 21, wherein said test sample comprises a fluid from said test subject.
 23. A method according to claim 21, wherein said test sample comprises blood or a fraction obtained from blood.
 24. A method of producing a therapeutic agent for treating or influencing tumors of epithelial type cells in a patient, comprising the steps of: a) isolating a ligand which binds to a discoidin-type binding domain of a polypeptide having a sequence of amino acids shown in FIG. 1 (SEQ ID NO: 2) or a portion thereof; b) utilizing said isolated ligand to produce a therapeutic agent that binds to said discoidin-type binding domain; and c) providing a quantity of said therapeutic agent sufficient to influence said tumor's behavior in the patient.
 25. A method of immunotherapy for treating a patient with tumors of epithelial type cells in which the epithelial cells present a characteristic membrane bound polypeptide comprising administering to the patient a sufficient quantity of said polypeptide to produce a suitable immunological response against said tumor cells, wherein said polypeptide has a sequence of amino acids shown in FIG. 1 (SEQ ID NO: 2) or a portion thereof sufficient to generate said response.
 26. A method of inhibiting in a patient the growth of tumors of epithelial type cells in which the epithelial cells present on their surfaces a characteristic membrane bound polypeptide that influences the cells' growth by binding a specific regulatory ligand comprising administering to the patient a quantity of said polypeptide sufficient to bind to and limit the amount of available said ligand to produce said inhibition, wherein said polypeptide has a sequence of amino acids shown in FIG. 1 (SEQ ID NO: 2) or a portion thereof. 